A Unification-Based Parser for Relational Grammar

نویسندگان

  • David E. Johnson
  • Adam Meyers
  • Lawrence S. Moss
چکیده

We present an implemented unification-based parser for relational grammars developed within the s t ra t i f ied f ea tu re g r a m m a r (SFG) framework, which generalizes Kasper-Rounds logic to handle relational grammar analyses. We first introduce the key aspects of SFG and a lexicalized, graph-based variant of the framework suitable for implementing relational grammars. We then describe a head-driven chart parser for lexicalized SFG. The basic parsing operation is essentially ordinary feature-structure unification augmented with an operation of label unification to build the stratified features characteristic of SFG. I N T R O D U C T I O N Although the impact of relational grammar (RG) on theoretical linguistics has been substantial, it has never previously been put in a form suitable for computational use. RG's multiple syntactic strata would seem to preclude its use in the kind of monotonic, unification-based parsing system many now consider standard ([1], [11]). However, recent work by Johnson and Moss [2] on a Kasper-Rounds (KR) style logic-based formalism [5] for RG, called St ra t i f ied Fea ture G r a m m a r (S FG), has demonstrated that even RG's multiple strata are amenable to a feature-structure treatment. Based on this work, we have developed a unification-based, chart parser for a lexical version of SFG suitable for building computational relational grammars. A lexicalized SFG is simply a collection of s t ra t i f ied fea tu re graphs (Sgraphs) , each of which is anchored to a lexical item, analogous to lexicalized TAGs [10]. The basic parsing operation of the system is S-graph unif icat ion (S-unification): This is essentially ordinary feature-structure unification augmented with an operation of label unification to build the stratified features characteristic of SFG. R E L A T E D W O R K Rounds and Manaster-Ramer [9] suggested encoding multiple strata in terms of a "level" attribute, using path equations to state correspondences across strata. Unfortunately, "unchanged' relations in a stratum must be explicitly "carried over" via path equations to the next stratum. Even worse, these "carry over" equations vary from case to case. SFG avoids this problem. S T R A T I F I E D F E A T U R E G R A M M A R SFG's key innovation is the generalization of the concept ]eature to a sequence of so-called rela t ional signs (R-signs). The interpretation of a s t ra t i f ied fea ture is that each R-sign in a sequence denotes a primitive relation in different strata. 1 For instance, in Joe gave Mary tea there are, at the clause level, four sister arcs (arcs with the same source node), as shown in Figure h one arc labeled [HI with target gave, indicating gave is the head of the clause; one with label [1] and target Joe, indicating Joe is both the predicateargument, and surface subject, of the clause; one with label [3,2] and target Mary, indicating that l We use the following R-signs: 1 (subject), 2 (direct object), 3 (indirect object), 8 (chSmeur), Cat (Category), C (comp), F (flag), H (head), LOC (locative), M (marked), as well as the special Null R-signs 0 and/, explainedbelow.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars

We describe work toward the construction of a very wide-coverage probabilistic parsing system for natural language (NL), based on LR parsing techniques. The system is intended to rank the large number of syntactic analyses produced by NL grammars according to the frequency of occurrence of the individual rules deployed in each analysis. We discuss a fully automatic procedure for constructing an...

متن کامل

A Corpus-based Probabilistic Unification Grammar

This paper describes an attempt at creating a robust parsing system for the SCHISMA task domain. We describe how a probabilistic unification grammar is generated from a corpus of utterances collected in Wizard of Oz experiments. We tagged this corpus with syntactic categories and superficial structure using Standard Generalised Markup Language (SGML). From the annotated data thus obtained a pro...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

Feature Engineering in Persian Dependency Parser

Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...

متن کامل

Backbone Extraction And Pruning For Speeding Up A Deep Parser For Dialogue Systems

In this paper we discuss issues related to speeding up parsing with wide-coverage unification grammars. We demonstrate that state-of-the-art optimisation techniques based on backbone parsing before unification do not provide a general solution, because they depend on specific properties of the grammar formalism that do not hold for all unification based grammars. As an alternative, we describe ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993